Prototype based machine learning for clinical proteomics
نویسنده
چکیده
Clinical proteomics opens the way towards new insights into many diseases on a level of detail not available before. One of the most promising measurement techniques supporting this approach is mass spectrometry based clinical proteomics. The analysis of the high dimensional data obtained from mass spectrometry asks for sophisticated, problem adequate preprocessing and data analysis approaches. Ideally, automatic analysis tools provide insight into their behavior and the ability to extract further information, relevant for an understanding of the clinical data or applications such as biomarker discovery. Prototype based algorithms constitute efficient, intuitive and powerful machine learning methods which are very well suited to deal with high dimensional data and which allow good insight into their behavior by means of prototypical data locations. They have already successfully been applied to various problems in bioinformatics. The goal of this thesis is to extend prototype based methods, in such a way that they become suitable machine learning tools for typical problems in clinical proteomics. To achieve better adapted classification borders, tailored to the specific data distributions which occur in clinical proteomics, the prototype based algorithms are extended by local relevance determination and other problem specific metrics. Fuzzy classification is introduced into prototype approaches to allow for the integration of insecure class label information and to provide the possibility to judge the safety of the classification as it is typically needed in clinical domains. Further margin based active learning is developed to achieve a faster training and better generalization ability ideally suited for the often complex classification problems in clinical research. All algorithms are extensively tested for several clinical data sets in the context of cancer research, including publicly available benchmark data sets as well as recent clinical data sets obtained by state-of-the-art biotechnology.
منابع مشابه
Prototype based Machine Learning for Clinical Proteomics
Clinical proteomics opens the way towards new insights into many diseases on a level of detail not available before. One of the most promising measurement techniques supporting this approach is mass spectrometry based clinical proteomics. The analysis of the high dimensional data obtained from mass spectrometry asks for sophisticated, problem adequate preprocessing and data analysis approaches....
متن کاملAnalysis of Spectral Data in Clinical Proteomics by Use of Learning Vector Quantizers
Clinical proteomics based on mass spectrometry has gained tremendous visibility in the scientific and clinical community. Machine learning methods are keys for efficient processing of the complex data. One major class are prototype based algorithms. Prototype based vector quantizers or classifiers are intuitive approaches realizing the principle of characteristic representatives for data subset...
متن کاملCancer informatics by prototype networks in mass spectrometry
OBJECTIVE Mass spectrometry has become a standard technique to analyze clinical samples in cancer research. The obtained spectrometric measurements reveal a lot of information of the clinical sample at the peptide and protein level. The spectra are high dimensional and, due to the small number of samples a sparse coverage of the population is very common. In clinical research the calculation an...
متن کاملAdvanced metric adaptation in Generalized LVQ for classification of mass spectrometry data
Metric adaptation constitutes a powerful approach to improve the performance of prototype based classication schemes. We apply extensions of Generalized LVQ based on different adaptive distance measures in the domain of clinical proteomics. The Euclidean distance in GLVQ is extended by adaptive relevance vectors and matrices of global or local influence where training follows a stochastic gradi...
متن کاملDevelopment of Linear Vernier Hybrid Permanent Magnet Machine for Wave Energy Converter
Today, due to the limited supply and rapid consumption of fossil fuels, transitioning towards renewable energy supplies has become more important than ever.. The purpose of this paper is to present a new linear permanent magnet vernier machine structure which is designed to capture wave energy and improve the performance of the prototype vernier machine. By halving the proposed vernier machine,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006